A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data
نویسندگان
چکیده
منابع مشابه
A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data
Similarity or distance measures are core components used by distance-based clustering algorithms to cluster similar data points into the same clusters, while dissimilar or distant data points are placed into different clusters. The performance of similarity measures is mostly addressed in two or three-dimensional spaces, beyond which, to the best of our knowledge, there is no empirical study th...
متن کاملA Geometric View of Similarity Measures in Data Mining
The main objective of data mining is to acquire information from a set of data for prospect applications using a measure. The concerning issue is that one often has to deal with large scale data. Several dimensionality reduction techniques like various feature extraction methods have been developed to resolve the issue. However, the geometric view of the applied measure, as an additional consid...
متن کاملA Comparative Study of String Dissimilarity Measures in Structural Clustering
This paper addresses structural clustering by stressing the distinction between string matching and structural resemblance. The hierarchical agglomerative clustering concept and a partitional approach are explored in a comparative study of several dissimilarity measures: minimum code length based measures; dissimilarity based on the concept of reduction in grammatical complexity; string matchin...
متن کاملa study on rate making and required reserves determination in reinsurance market: a simulation
reinsurance is widely recognized as an important instrument in the capital management of an insurance company as well as its risk management tool. this thesis is intended to determine premium rates for different types of reinsurance policies. also, given the fact that the reinsurance coverage of every company depends upon its reserves, so different types of reserves and the method of their calc...
Neural Gas Clustering for Dissimilarity Data with Continuous Prototypes
Prototype based neural clustering or data mining methods such as the self-organizing map or neural gas constitute intuitive and powerful machine learning tools for a variety of application areas. However, the classical methods are restricted to data embedded in a real vector space and have only limited applicability to noneuclidean data as occurs in, for example, biomedical or symbolic fields. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLOS ONE
سال: 2015
ISSN: 1932-6203
DOI: 10.1371/journal.pone.0144059